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1. Introduction 

Single molecule force-measuring techniques have made possible the controlled manipu- 
lation of individual molecules by applying forces on the piconewton scale. Current force 
measuring devices include the atomic force microscope, optical and magnetic tweezers, 
or even microneedles and biological membranes used as force probes. Single molecule 
manipulation has been used to investigate many problems in molecular and cellular 
biophysics (see [1,2] for recent reviews). To cite just a few examples: the mechanical 
properties of biopolymers such as DNA [3-5] have been established for the first time; 
the folding/unfolding processes in individual RNA or protein molecules [6-8] have been 
investigated; the interactions between DNA and proteins [9], but also between DNA 
and RNA, have been studied at the molecular level [10]; the motion of single molecular 
motors has also been followed in real time [11-15]. 

An important aspect of single molecule force experiments is the possibility to 
monitor the time evolution of individual molecules by recording the molecular extension. 
Measuring forces and extensions as functions of time provides a lot of information about 
thermodynamics and kinetics of individual molecules. A very useful technique to achieve 
this goal are optical tweezers, which are suited to accurately measure forces in the 
range 0.1-100 pN. Using optical tweezers, it is possible to derive the free energies of 
formation of biomolecules with good accuracy. At the same time, optical tweezers allow 
us to investigate questions related to the kinetics of folding, a challenging problem in 
biophysics and statistical mechanics. A physical quantity useful to characterize the 
behaviour of complex systems is the free energy landscape. The free energy landscape 
describes the energetics of the configurational space of a system. Introduced and applied 
in the context of disordered and glassy systems, this concept finds a major application 
in small systems, where thermal fluctuations entail large conformational fiuctations and 
the system can explore a large portion of the configurational space. 

In this paper we investigate the thermodynamics and kinetics of force induced 
folding/unfolding (hereafter referred as F/U) of short DNA hairpins. Investigating 
force kinetics of DNA hairpins presents several advantages [16,17] over other molecular 
constructs. In particular, DNA sequences can be synthesized with relative ease and 
DNA degrades comparatively less than other molecules (e.g. RNA) do. We carry 
out single DNA pulling experiments using optical tweezers to extract information from 
thermodynamics and kinetics under the application of an external force. The results are 
compared with theoretical predictions based on the concept of the free energy landscape. 
The paper is divided as follows. After a brief summary of the type of experiments in 
section [21 we elaborate on the concept of free energy landscape applied to nucleic acid 
hairpins (DNA or RNA) in section [31 Section [H explains how to extract information 
about thermodynamics and kinetics from pulling data. An analysis of the kinetic 
parameters extracted from our experiments is presented in section [5] Section [6] shows an 
alternative method to derive free energy differences using fluctuation relations. Finally, 
in section [7] we discuss how to extract the free energy of formation of the hairpin at zero 
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force, both from kinetics and fluctuation relations. After the conclusive section, three 
appendices supplement the results of this paper detailing some technical subjects. 

2. Mechanical unfolding of DNA hairpins 

Our experimental setup is shown in figure [I](a). The DNA hairpin is tethered between 
two beads by using double-stranded DNA (dsDNA) handles. Experiments are carried 
out in a newly designed miniaturized dual-beam laser optical tweezers apparatus [18]. 
One bead is immobilized in the tip of a micropipette that is solidary with the 
fiuidics chamber, the other bead is captured in an optical trap generated by two 
counterpropagating laser beams [19]. The force acting on the bead can be directly 
measured from the change in light momentum deflected by the bead. A steerable optical 
trap can be moved up and down along the vertical axis so to repeatedly unfold/refold 
the molecule. Every pulling cycle consists of a stretching process (hereafter referred as 
S) and a releasing (hereafter referred as R) process. In the stretching part of the cycle 
the molecule is stretched from a minimum value of the force (/min ~ 10 pN), so small 
that the hairpin is always folded, up to a maximum value of the force (/max ~ 20 pN), so 
large that the hairpin is always unfolded. During the releasing part of the cycle the force 
is decreased from /^ax back to /min- The force is varied at the same loading rate (the 
rate at which the force is increased or decreased) in both stretching and releasing stages 
of the cycl^, and recorded with an acquisition frequency of 1 kHz. All experiments 
were done at a temperature 23° — 24° C in a IM NaCl aqueous buffer with neutral pH 
(7.5) stabilized by Tris HCl and IM EDTA. 

The molecular construct is shown in figure [U^b) and consists of a DNA hairpin of 
21 base pairs (bps) ending with a tetraloop GAAA. The hairpin is inserted between two 
identical short dsDNA handles of 29 bps each. The sequence of this DNA hairpin is 
canonical (i.e. all base pairs are complementary) and has been specifically designed to 
produce a two-state folder (see below in section [3]). 

Force-distance curves (FDCs) in our experiments represent the force acting on the 
molecule as a function of the relative position of the trap along the force axis. From the 
FDC it is possible to extract the molecular extension, so to represent the force versus the 
molecular extension in what is known as a force-extension curve (EEC). Many works in 
single molecule manipulation often use such representation. However, in many aspects 
it is better to use the trap position rather than the molecular extension to draw pulling 
curves. In fact, the former is the only parameter that is externally controlled (referred to 
as the control parameter [20]) whereas the latter is subject to fluctuations that introduce 
additional (albeit small) corrections (e.g. in the measure of the mechanical work exerted 

§ This process is performed at constant pulling speed v. Since the elasticity of DNA (dsDNA handles 
and unfolded DNA hairpin) is force-dependent, strictly speaking the loading/unloading rate r is not 
constant throughout the pulling process. Nevertheless, in the force range of our experiment the rigidity 
of the optical trap fcb is much smaller than the stiffness of the handles and the unfolded DNA hairpin. 
Therefore, the effective rigidity ()3ip is kcs ~ fcb, and our system verifies r = vkcs ~ vk^. 
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Figure 1. (a) Experimental set up. (b) DNA hairpin sequence. The 5' and 3' labels 
indicate the polarity of the phosphate chain of the hairpin. 



upon the molecule). Although either description contain the same information, we will 
stick to the FDC picture throughout this paper. 

Figure [SJl^a) shows some typical FDCs for the sequence under study at three different 
loading rates (slow and fast). The FDC shows a linear dependence of the force versus 
distance as a consequence of our choice of short handles. In fact, because the handles 
are very rigid (when compared to the rigidity of the trap) the effective rigidity of the 
system made of bead and handles is mostly determined by the constant rigidity of the 
Hookean (i.e. linear) optical trap. Force rips are generated each time the molecule 
folds/unfolds: after unfolding, a segment of 46 nucleotides of ssDNA is released so the 
force suddenly drops as the trapped bead relaxes toward the center of the optical trap; 
whereas after refolding the closure of the DNA hairpin pulls the bead away from the 
trap and the force increases. 

When pulling at slow loading rates the molecule shows low hysteresis in the value of 
the unfolding/refolding force. Moreover, the hairpin can execute several F/U transitions 
during the stretching and releasing stages of the cycle. In contrast, when pulling at fast 
loading rates, the molecule shows larger hysteresis in the value of the unfolding/refolding 
force, and multiple F/U transitions are unlikely. How many transitions are observed and 
how much irreversible are the stretching and releasing processes depends on how large 
is the pulling rate r compared to the typical folding/unfolding rate [48]. 



Dynamic force spectroscopy of DNA hairpins. I. 



5 



3. The free energy landscape 

To follow the dynamics of the folding/unfolding process in configuration space is a 
formidable task, due to the large number of interacting degrees of freedom. However, 
the collective behaviour displayed by the F/U transition suggests a simplified approach: 
one can judiciously choose one collective reaction coordinate and project the many- 
dimensional energy landscape (where we represent the energy of each microscopic 
configuration) onto a one-dimensional free energy profile, where each point stands for 
an ensemble of configurations. One minimum of the free energy profile represents the 
energetically favored folded state, another one the entropically favored unfolded state. 
In between, we find information about how much energy is needed (in the form either 
of heat or work) to explore intermediate states of the system. In this section we will 
focus on the case where pressure and temperature are constant, the applied force is the 
control parameter, while other variables such as the molecular extension or the position 
of the trap fiuctuate. The effect of the experimental setup on the free energy landscape 



is discussed in Appendix A and Appendix C 



As reaction coordinate, we choose the number of open base-pairs n. If is the 
number of base-pairs in the stem of the hairpin, and L is the number of bases in the 
loop (so that the total number of bases in the hairpin is 2N + L), the configuration with 
n = is the folded state, while the configuration with n = is the completely unfolded 
state. In the absence of an externally applied force, we will use the symbol G^{n) to 
denote the free energy to be delivered to the molecule (in the form of heat) in order 
to break the first n base pairs. G^{n) can be measured in bulk experiments, e.g. by 
calorimetry or UV absorbance: by melting oligonucleotides of different lengths we can 
enforce the desired number n of dissociated base pairs. The free energies of dissociation 
of the different nearest neighbour base pairs [21] as well as other different secondary 
structural elements (e.g. base-pair mismatches, loops, etc.) are used by Mfold [23] to 
extract the free energy of formation of the DNA molecule. In general, G^{n) will be a 
monotonically increasing function of n whenever base pairs dissociate; however, it may 
decrease in the presence of entropic structural elements such as loops [22]. 



3.1. The effect of force 

What is the effect of an externally applied force / on the free energy landscape G^{n)l To 
answer this question, we introduce the force- dependent free energy landscape G{xn, /), 
where Xn is the distance between the 3' and the 5' extremities of the hairpin, measured 
along the direction of the applied force [24,25]. We find it convenient to express the free 
energy in terms of Xn because this quantity, unlike n, is experimentally accessible. 

In the folded state n = 0, in the presence of an external force / 7^ 0, the hairpin is 
always oriented along the force axis, therefore Xq is equal to the diameter of the hairpin 
do (typically about 2 nm). If / = 0, then the hairpin is generally not aligned, and 
all we can say is that Xq < d^. For any intermediate state < n < A^, is equal 
to do, plus the equilibrium extension x'"(/) of a ssDNA molecule with contour length 
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Figure 2. (a) Schematic picture of the different configurations of a hairpin with A'' 
base pairs at force /. For all configurations where the molecule is not completely 
unfolded, the molecular extension is equal to a;'" + do where a;'" is the extension at 
force / of the released 2n bases of ssDNA. (b) Schematic picture of the free energy 
landscape at different forces /. The most important parameters are the distances 
from the transition state (TS) to the folded (F) and unfolded (U) states: and x^^ , 
respectively; the free energy difference AG{f) and the barrier B{f). As force increases 
the free energy landscape is tilted down favouring configurations of large extension Xn ■ 



In = '2nd (this relation can change in the presence of structural motifs such as base-pair 
mismatches) if d is the inter-base distance. The length is found by inverting the 

thermodynamic force-extension curve 

/ = i^sDNA(^) • (1) 

The explicit form of ([1]) depends on the particular model assumed (the most popular 
being the freely-jointed chain and the worm-like chain). In the totally unfolded state 
n = N, Xn is the equilibrium extension a;'^(/) of a ssDNA molecule with contour length 
In = {2N + L)d. In synthesis, 

Xn{f) = {l-6n,N)do+x'-{f), In = 2nd + 6n,NLd . (2) 

Note that in ([T]) we assume thermodynamic equilibrium for the ssDNA. It would be 
possible to include elastic free energy fluctuations in the stretching part of the free 
energy corresponding to the ssDNA to substitute the discrete x„ with a continuous 
variable x, but this would not change much the predictions of the model. 
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In the presence of an external force the free energy landscape G{xn, f) is tilted 
along the reaction coordinate: 

G(x„, /) = G\n) + GssdnaIO x'" ; /) - fxn , (3) 

where we have introduced 

Gssdna(0 ^ x'"; /) = / F4'bNA(y)dy , (4) 

Jo 

that is the reversible work needed to stretch a segment of ssDNA of contour length In 
from an initial extension equal to until a final extension (see figure [2]^a)), or, in 

other words, the area under the force-extension curve ([T]) between and x''". 

Using (jll), after a change of integration variable, we can rewrite the free energy 
landscape ([3]) as 

G{Xn, f) = G\n) - Xn{f')df' + 6n,Nfdo . (5) 

Jo 

From the free energy landscape ([3]) we can define two important force dependent 
parameters. The first is the free energy difference between the folded and the unfolded 

St cites 5 

AG{f) = GixM, f) - G(xo, /) = AGo + GssdnaIO ^ x'-; /) - - do)f , (6) 

where AGq = AG{f = 0) = G^{N) — 6**^(0) denotes the free energy of formation of the 
hairpin at zero force. In what follows we will simplify the notation by defining 

x^{f)=x'-{f)-do. (7) 

Xjn depends on the force / and is equal to the released molecular extension when the 
hairpin unzips completely. In this notation we have 

AG{f) = G{xN, f) - ^(xo, /) = AGo + Gssdna(0 ^ x'-; /) - /x^ . (8) 

The other important parameter is the free energy barrier, equal to the free energy 
difference between the transition and the folded states, 

B{f) = G(x^ /) - G(xo, f) = B, + Gssdna(0 ^ x'- /) - /x^ , (9) 

where x^ is defined as the value of x„, where G(x„, /) is maximum, Bq = G^{n^) — G°(0) 
is the free energy barrier at zero force, is the number of base pairs released at the 
transition state at force /, and Ip = /„f. 

We stress that the parameters AG{f) and B{f), as determined from the shape of 
the free energy landscape G{xn,f), depend directly on the applied force. Moreover, 
the relevant distances Xm and x^ are also expected to depend on the force due to the 
dependence of the molecular extension of the ssDNA with force, as shown in ([T]). Yet, 
such dependence is expected to be very small for forces in the vicinity of the F/U 
transition (see below). In what follows, and in order to lighten the notation, we will not 
indicate explicitly such force dependence for these distances. For later use we define the 
following quantities: 

AGi(/) = AGo + Gssdna(0 ^ x'-; /) =^ AG{f) = AGi(/) - /x„ (10) 
B^if) =5o + Gssdna(0^x'^;/) ^ B{f) = B,{f) - fx^ (11) 
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As we will see below, AGi, Xm and can be directly measured in pulling experiments. 

In the previous computation, we have neglected the work needed to orient the 
hairpin along the force axis. By treating the elastic response of the folded hairpin as a 
polymer of contour length equal to the hairpin diameter do and persistence length P, 
the work necessary to orient the hairpin along the force axis is inversely proportional 
to P. For a rigid object P is large, so the free energy of orienting the hairpin is indeed 
expected to be negligible. Not so the effect of the hairpin diameter, as showed in ([5]). 



3.2. Kramers-Bell kinetic rates 

According to the Kramers-Bell theory [26], the kinetic rates of unfolding and folding 
under tension in a two-state folder are given by 

[Gix^jy^ 



k-,{f) = koexp 



k^{f) = koexp 



knT 



-G{x^,f) + G{xoJ) + G{x''jy 
knT 



12a 



;i26) 



The k-t{f) {k^{f)) describes the kinetic rate to jump over the transition state from the 
folded (unfolded) state, ko stands for the attempt frequency of the hairpin (which may 
get contributions from the instrument and the whole molecular construct), /cb and T 
being respectively the Boltzmann constant and the temperature of the bath. The rates 
( I12ap and (\12b\i satisfy detailed balance, 
k^if) G{xNj)-G{xojy 



exp 



exp 



G'ssdna(0 



X^^-J)- fXr_ 



exp 



AG(/)\ 



(13) 



ksT J \ ksT J 

where AG{f) has been defined in ([8]). The coexistence force fc is defined as the value 
of the force at which the equilibrium constant is equal to one, 

k, = k^if,) = k^if,), (14) 

kc being the coexistence rate. According to ( fT3l) . this corresponds to AG{fc) = 0, or 

AGi(/c) = AGo + G,,dna(0 ^ x'^; f,) = f,x^ . (15) 

Summarizing, the kinetic rates f ll2ap and (112 61) can be rewritten in a more compact 
form, 

fB^{f)-fx^'- 



kAf) 
k^f) 



ko exp 
ko exp 



' B,{f)-AG,{f)+ fx 
ksT 



(16a) 
(166) 



where 



x^ = x^ + a;U^ (17) 

is equal to the change in molecular extension across the F/U transition. The terms 
AGi(/), Bi{f) are given in (ITOl) and (fTTj) . while each stretching contribution (contained 
in those terms) of the type Gssdna(0 — > x^{f)) has been defined in (jll) and ([I]). 
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Figure 3. Free energy landscape at various forces for the hairpin sequence shown in 
figurc[ljb). The stretching contribution has been calculated from Mfold using ((40|) and 
(|4T|l (see below in section[7]). Moreover, the effect of the diameter has been included 
in the calculation. The values shown for the distances , x^^ are approximately 
1 nm larger than the values found in our experiments. However, the value for the 
fragility /i (|20|) inferred from these distances (0.1) is compatible with that found in the 
experiments (0.0975, see table [I]). 



3.3. Simplified version of the rates 

In a further simplified description [27], it is common to neglect any force dependence for 
^Gi{f) and Bi{f) in fll6ap and (116&I) . so to incorporate these quantitites into effective 
values for the barrier Bi and the free energy difference AGi measured at the transition 
force fc. This means that AGi(/) ^ AGi(/c) and — -Bi(/c) if / is not too far from 

fc- This approximation is justified because the elastic contributions contained in those 
terms vary much less with force than the products fx^, fx^^ do. A mathematical proof 



of this result is shown in Appendix B In what follows we will drop any force dependence 



in the effective parameters Bi and AGi and write the kinetic rates as follows: 

k^f) = k^exp (^^^ , k^if) = k^exp (^^^^^/^) > (18) 

where corresponds to the unfolding rate at zero force and is given by 

km = koexp (--^\ (19) 



kBT. 

As we will see in section |U the simplified rates f|T8|) reproduce reasonably well the 
experimental results. 

The success of the two-states model in reproducing the kinetics of the force induced 
F/U reactions depends on the hairpin sequence. In the absence of structural motifs that 
may induce alternative F/U pathways or intermediate states (see for example [28]), a 
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short hairpin (a few tens of base pairs) will display cooperative two-states behaviour. 
In addition, if the sequence is designed in such a way that the free energy landscape 
along the reaction coordinate has a single barrier, then such molecule is expected to 
behave as a two-state folder displaying simple Arrhenius kinetics. In this work we have 
designed a DNA sequence (see figure [It^b)) that has such properties. The free energy 
landscape as function of the molecular extension has been calculated at various forces 
using the free energy values from Mfold and the elastic properties of ssDNA. In figure [3] 
we show the calculated free energy landscape at various forces around the coexistence 
force /c — 17.9 pN, where the free energies of the folded and unfolded states are equal. 



3.4- The fragility 

An important aspect of the Kramers-Bell rates f|T8l) is their strong dependence on force 
which is determined by the values of and x^^ . If x^ ^ x^^ , then the transition 
state is located far away from the folded state and the molecule deforms considerably 
before it unfolds. In the other case, when x^^ ^ x^ , the transition state is located close 
to the folded state and the molecule unfolds without deforming much. A quantitative 
measure of how much the native structure deforms before unfolding occurs is given by 
the fragility parameter [29-32] 

^ = , ^UF = —: • (20) 

H lies in the range [—1 : 1] and defines the degree of compliance of the molecule under 
the effect of tension. Fragile or compliant molecules are those in which x^ is larger than 
x^^ and fi is positive. In contrast, when x^^ is larger than x^ and fi is negative, we 
talk about brittle structures. The fragility has been proved to be a useful parameter 
to describe the mechanical unfolding of RNA hairpins with more than one transition 
state [32]. 



4. Breakage force kinetics 

In order to manipulate a DNA hairpin using optical tweezers, the free ends of the 
molecule are attached to micron-sized beads by using dsDNA handles. In this 
experimental configuration (see figure [H^a)), the force fluctuates and the control 
parameter is the extension between the center of the trap and the tip of the micropipette. 
This experimental setup corresponds to the so called mixed ensemble. It is then possible 
to describe the F/U kinetics of the DNA hairpin using the two-state model depicted in 
figure [21 taking into account that the variable that controls the shape of the free energy 
landscape is the trap-pipette distance rather than the force. The mixed ensemble gets 
contributions from the different elements of the experimental setup. In their simplified 
form the kinetic rates in the mixed ensemble can be shown to obey ( |T8l) with identical 
force dependent terms in the exponent (i.e. equal values for AGi,x^ ,x^^) but different 
prefactors (see Appendix C in [33]). 
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Distance X (nm) 

Figure 4. A representative FDC showing the stretching (black) and releasing (red) 
parts of a pulling cycle. It shows the first rupture force during stretching (/g) and 
releasing (/p) processes. 



The simplest way to extract the values of AGi, x^, x^^ from the pulling data is to 
analyze the distribution of first rupture forces along stretching and releasing parts of 
the cycle, /g and The first rupture force /g^j^^ along the stretching (releasing) part 
of the cycle is the value of the force at which the first force rip is observed at the time 
where the first jump occurs. An illustration is shown in figure HI Useful quantities that 
can be measured in pulling experiments are: the survival probability -Ps{R)(/) that the 
molecule remains in the folded (unfolded) state along the stretching (releasing) process 
until reaching the force /, the mean value and the variance of the first rupture forces 

/s(R) ■ 

Survival probability. The distribution Ps{R)(/) satisfies the following master equation: 

^^SM^A . -L,_,(/(i))Ps,R,(/(i)) . (21) 

Pg(R)(/) is related to the experimentally measured distribution of first rupture forces, 
Ps(R)(/), by 

^s(/) = 1 - Ps(/')d/', Pr(/) = 1 - f^^^ PkU'W. (22) 

•'/mill f 

where /min (/max) is the initial (final) force along the stretching-releasing cycles. For a 
protocol at a constant loading/unloading rate r, -Ps{r)(/) can be exactly computed: 

(k^T 



Psif) = exp 



rx^ 



(k^if) - k^ifmin)) 



(23 a) 
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Figure 5. (a) Force-distance curves (FDCs) corresponding to 5 cycles at fast and slow 
pulling speeds (200 nm/s and 25 nm/s respectively, corresponding to loading rates of 
8.1 and 1.0 pN/s). Continuous red (dashed orange) lines correspond to fast stretching 
(releasing) parts of the cycle. Continuous blue (dashed cyan) lines correspond to slow 
stretching (releasing) parts of the cycle. Note that hysteresis increases with the pulling 
speed. Moreover, FDCs at slow speeds show several transitions between the folded and 
the unfolded states, (b) First rupture force distributions for the stretching (continuous 
lines) and releasing process (dashed lines) for three molecules at three different loading 
rates r = 1.76 pN/s (blue), r = 4.8 pN/s (green), r = 14.5 pN/s (red). 



PrU) = exp 



{k^{f)-k^{f^.,^)) 



(235) 



where we used the unfolding (folding) rates given in (ITSl) [34]. In the limit f^nin <^ fc (S 
process) and /max < fc (R process), the function log[-r(log(Ps{R)(/))] is given by 

log[-r(log(Ps(/))] = log (^^] + (^] f , (24a) 




log[-r(log(PR(/))] = log + - 7-7^ / • (245) 



These results show that the function log[—r(log(Ps(R) (/))], plotted as a function of the 
applied force /, is a straight line with a slope inversely proportional to the position of 
the kinetic barrier [x^^) and intercepts (as, a^) related to the rate and the free 
energy difference AGi [34]. To extract the value of AGi, we do linear fits to (124 gp and 
(124 5p . then use the independent coefficients as,aR to obtain 

||..,_.,H-log(^). (25) 

In order to extract Bi we can use the relation (1191) . Bi = —k^Tloglk^/ko) where 
can be extracted from the value of as but ko is unknown. In order to determine Bi 
one needs to make further assumptions. For example, it is possible to use effective 
one-dimensional Kramers models applied to molecular free energy landscapes [32,35,36] 
to infer the value of Pi. Figure [6] shows an experimental test of (124 gp . (124 &p using the 
results shown in figure O^b). 
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13 14 15 16 17 18 19 20 21 22 
Force (pN) 

Figure 6. Experimental data for log(— r log(P(R gj)) plotted as function of force 
can be fit to straight lines according to (|24a|) . (j246|) . The slopes of the lines give the 
distances = 9.8 nm, x^^ = 8.1 nm (see table [Ij. Data correspond to three different 
molecules pulled at different loading rates r ~ 1.76 pN/s (blue), r ~ 4.8 pN/s (green), 
r = 14.5 pN/s (red). Circles correspond to the stretching process. Squares correspond 
to the releasing process. Note that the value of the force at which folding and unfolding 
lines cross each other (~ 17.5 pN) lies pretty close to the average coexistence force, 
17.75 pN (see table [I]). 



Mean and variance of first rupture forces. The dependence on the rate r of the mean 
value and the standard deviations of /| and can be computed in the relevant 
experimental regime, that is when 

(/c being the coexistence force) [37]. The result is 

(/s) = ^[C + log(r)] + 0(a) , (cx;.) = ^ + 0(a) , (27) 

(/r) = - log(^)] + 0(a) , (a;.) = ^ + 0(a) , (28) 

where (o"/*), (cf*) denote the rms deviation of the first rupture forces. The constants 
C and C depend on the characteristics of the molecule and on the initial force values 
along the stretching and releasing processes respectively. Equation fl24ap . fl246|) and 
( !27|) . (128!) provide a way to extract the relevant parameters that characterize the free 
energy landscape from the experimental data. A test of the validity of ( l27j) . (l28l) is 
shown in figure [3 
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Figure 7. Average and rms deviation of first rupture forces as a function of the 
loading rate. Data have been taken by averaging over different molecules at different 
loading rates (see table [T|). The average rupture force (left panel) has been fit to (|27|) 
fixing the values = 9.8 nm, x^^ = 8.1 nm (table[l|). We get C = 43.64, C = 34.35 
for the best fit. The rms of the rupture force (right panel) is constant with the loading 
rate (l28l) but cannot be fit to (l28l) with the values of x^ 



obtained from the survival 
probability analysis. Instead we get x^ = 7.74 nm, x^^ = 6.34 nm (again compatible 
with fi = 0.1), which however are 30% smaller than the expected values. 



5. Kinetic parameters for the hairpin 

In this section we use the aheady cited experimental data to extract the parameters 
that characterize the free energy landscape and the kinetics of the hairpin. A summary 
of the results obtained by analyzing data for 11 molecules pulled at different speeds is 
shown in table [H We have extracted the different values of the parameters for each 
molecule to find the mean and the standard deviation. The values x^,x^^ have been 
extracted from the linear fits (124 gp . ( 12461) . whereas Xm and fi are given by (|T7l).(l20l). For 



each molecule the value of AGi has been obtained from (125|) whereas the coexistence 
force fc and the coexistence rate kc are extracted from ( ITSl) . IHM and (ITSll : 

fc = , = k^exp (^^^ . (29) 

To complement such estimates we also show another estimate (/c) for the average 
coexistence force which corresponds to the average first rupture forces along the 
stretching and release parts of the cycle: 

Tc = lGlTm- (30) 

In addition, we also verify that the product of the coexistence force fc ( l30!) times the 
extension ged over all molecules is compatible with the average value of AGi. 
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Table 1. Mean and standard deviations of different parameters obtained from the 
kinetic experiments. Averages are taken over 11 different molecules that have been 
pulled at different puUing speeds: 18.5 nm/s or 1 pN/s (1 molecule, 193 cycles), 36.5 
nm/s or 1.95 pN/s (2 molecules, 160 cycles), 86.2 nm/s or 4.88 pN/s (3 molecules, 570 
cycles), 156 nm/s or 8.1 pN/s (2 molecules, 725 cycles), 274.3 nm/s or 14.9 pN/s (3 
molecules, 1580 cycles). 



(nm) 


x^^ (nm) 


Xm (nm) 


M 


/c (pN) 


fc (pN) 


9.87(36) 


8.13(33) 


18.06(52) 


0.097(23) 


17.91(09) 


17.75(08) 


AGi (fefiT) 


/c • 3:^111 {kBT) 


fee (Hz) 


A/ (pN) 


fccff (Hz) 


Aa; (nm) 


78.7(2.4) 


78.0(2.3) 


0.58(08) 


1.230(47) 


0.0544(10) 


22.6(1.2) 



Finally, we also show three more quantitites: 1) the average force jump across the 
transition, A/; 2) the average slope of the FDC corresponding to the combined stiffness 
kes of bead and handles [38,39], 

^ - ^ + ^ (31) 

and 3) the average retraction of the combined extension of the bead and handles (xb+Xh) 
induced by the force change, Ax = Af /kcs- 

From table [Hit emerges a remarkable fact: the expected molecular extension Xm at 
fc = 17.91pN is 3 nm smaller than the change in extension of combined system formed 
by bead and handles, which is around 21.30 nm if we subtract to the total extension 
of the ssDNA (around 23.30 nm, see below in table [2]) the diameter of the hairpin 
do = 2 nm. Is this expected? What is the real average change in molecular extension 
of the hairpin across the F/U transition? The net change in molecular extension across 
the transition can be estimated from the average retraction experienced by bead and 
handles. Ax = Af /kes- 

Assuming that handles have a large but finite stiffness (around 500 pN/yum), part 
of the retraction (~ 10% of the total released molecular extension) might be accounted 
for by the retraction of the handles. This makes Ax ~ 22.6 (table [1]) an upper bound 
to the released molecular extension by the ssDNA. 

An important feature of our experiments is the variability observed in the values 
measured for different molecules. In figure M we show histograms of values obtained for 
a given set of parameters. Although distances x^,x^^,Xm and the free energy value, 
AGi, typically show a 15% variation around the average value, other quantitites like /i 
or kc show a more large variability. 

Is it possible to infer the value AGo from the reported value for AGi? The inclusion 
of the stretching contributions in order to infer the value of AGq is discussed below in 
section [71 
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Figure 8. Histograms of some of tlie parameters reported in table [T] Statistics is 
collected over 11 molecules pulled at different spedds (see caption of table [1] for more 
details). The vertical dashed hnes (cyan color) show the mean of the distribution as 
given in table [TJ 



6. Free energy recovery 

Alternative methods to extract the free energy difference AGi are provided by 
fluctuation relations. Fluctuation relations are symmetry identities that relate the 
probability of a system to absorb or release a given amount of energy to the environment 
along irreversible processes. In our pulling experiments, single molecules are in 
a transient nonequilibrium state, as revealed by the systematic hysteresis observed 
between the stretch and release processes at forces around the coexistence region. When 
the trap is moved fast enough, then the free energy landscape is modified too quick and 
the molecule cannot populate the different states (folded and unfolded) according to the 
Boltzmann-Gibbs weight. Under equilibrium conditions, (|T3|) predicts 

. . exp {-^) . (32, 

Pf(/) k^if) V ^bT J 

Hysteresis effects indicate that measured populations of folded and unfolded molecules 
PvF{f),PF{f), averaged over many pulling cycles, will not satisfy (|32|) . This precludes 
the validity of thermodynamic equilibrium in our pulling experiments. Note that 
mechanical equilibrium is probably satisfied at the experimentally accessible pulling 
speeds, as revealed by the fact that all other relaxational timescales in the system 
(beads, handles and ssDNA) are much shorter than the F/U timescale [40-42]. 
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6.1. The Crooks fluctuation relation 

Let Vrs{R) denote the mechanical work exerted on the molecule by moving the optical 
trap. This is given by 

dXFs(R)(X), (33) 

where the subindex S (R) refer to the stretching (releasing) stage of the cycle. In 
what follows, we will take Ws and PVr as positive and negative quantities respectively, 
although, strictly speaking and according to fl33|) . both have positive signs. In fact, 
during the stretching (releasing) parts of the cycle the optical trap delivers (extracts) 
mechanical work to (from) the molecule. During the stretching part of the cycle the 
work is positive {dX > 0) whereas during the releasing part of the cycle it is negative 
{dX < 0). By repeatedly pulling the molecule many times we can measure the stretching 
and releasing work distributions, 

Ps{W) = {6{W - Ws)) , Pk{W) = {6{W - Wr)) . (34) 

where (..) stands for the average over trajectories. For an infinite number of pulls, the 
Crooks fluctuation relation [43] establishes that the probability distributions (134|) satisfy 
the following identity: 



This relation provides an experimental way to extract the value of AGx2Z 
measurements of the irreversible work. In particular, the two distributions Ps{W) and 
Pr{—W) cross each other at the reversible work value, W = AGx^Z^ thus providing 
a method to derive the free energy difference between the initial and final states. The 
reversible work AG^™!"' gets contributions from pulling the bead and stretching the 



handles and the ssDNA (see Appendix C ). In particular, we have defined AGi(/max) 
in (IC.Sp as that part of the reversible work that gets contributions only from unfolding 
and stretching the hairpin, 

AGi(/„,,.) = AG^Z:: - 7^ [(/max)' - (/min)'] • (36) 

In what follows we subtract for each molecule from the value of the work W the term 



^ [(/max)' — (/min)'] to directly estimate the contribution to the free energy of the 
hairpin, AGi(/max)- The value of AGi(/max) can be derived by looking at the crossing 
of the stretching and releasing work distributions. Although stretching and releasing 
work distributions for the same molecule taken at different speeds cross at a common 
value AGi(/jnax) (data not shown), work histograms mostly change from molecule to 
molecule revealing some variability in our estimates for AGi(/max)- This is probably 
due to the fact that we are using very short tethers as handles which, due to their large 
rigidity and depending on the angle formed by the tether connecting the two beads, 
introduce a high variability to the free energy correction [(/max)' — (/min)']- 



2k. 
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Figure 9. (Left) Typical work distributions for three molecules at three different 
loading rates: 1 pN/s (slow, blue), 4.88 pN/s (medium, green), 14.9 pN/s (fast, red). 
Work values have been shifted in such a way that the crossing between the stretching 
and releasing work distributions is observed at the average value AG™ = 80.94^3^ 
(see table [3]). The vertical lines show the range of experimental error estimated for 
the value of AG™. (Right) Bennett aceptance ratio method to extract the value of 
^Gi{fmax) for all 11 molecules (shown as black dots). 



In figure [9] (left panel) we sfiow work distributions for tfiree different molecules 
pulled at different speeds. To better show the systematic dependence of work 
distributions on the pulling speed, the histograms shown in figure [9] (left panel) have 
been shifted to make the crossing point (between stretching and releasing distributions) 
coincide with the value of AGi(/niax) averaged over all molecules (see below and table [3]). 

In order to validate the fluctuation relation (l35l) and extract the value of AGi (/max) 
for each molecule we have analyzed data in two ways: 

Bennet acceptance ratio method. The details of this method have been described 
elsewhere [20,44]. In a nutshell, Bennett's method consists in defining two functions, 

zs{u) = (guiW)exp (-^)) , (37a) 



zn{u) = log i{guiW))s) , (376) 
where gu{W) is an arbitrary real function that depends on a parameter u and the average 
(. . .)s(R) is taken over the stretching (releasing) process. From (135|1 . it can be proven 
that 

zM-zs{u) = —^, (38) 

showing that the difference between the two functions is constant over u and equal to 
the reversible work. It has been shown by Bennett [45] that the optimal (i.e. minimal 
variance) estimate of AGx'^Z given by 

^"'^'^'- l + ^expf^r '''' 
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with u = AGx^^^- iVs(R) stands for the number of pulls along the stretching (releasing) 
process. The same result has been obtained by Pande and coworkers by using maximum 
likelihood methods [46]. In figure M (right panel) we show the test applied to work 
data for all molecules. The experimental data for zr(m) — zs{u) is approximately 
constant for each molecule over a wide range of u. The best estimate for AGi(/max) in 
fl36|) is obtained by looking at the intersection of the experimental data with the line 
^R(yu) — zs{u) = u/k^T (black dots in the figure). As we can see the fluctuation relation 
is validated for each molecule. However there is a strong variability from molecule to 
molecule for the values AGi(/jnax)- 



Direct representation of the probability ratio. The validity of the fluctuation relation 
fl35|) is again observed in figure [10] (left panel) where we have plot the ratio in the l.h.s. 
of fl35|) in logarithmic scale versus the work W. Like we did with the work histograms 
shown in figure [9] (left panel), work values have been shifted for the estimate AGi(/max) 
to match the the average value AG^^ = 80.94 k-gT over all molecules (table [3]). Finally, 
in figure fTOl (right panel) we plot the histogram of values for AGi(/niax) obtained for 
different molecules. As we already saw for kinetics, the value of AG'i(/max) changes from 
molecule to molecule, yet the average value AGf^ = 80.94 kBT (table [3]) is compatible 
with the estimate obtained from kinetics AGi = 78.73 (section [5] and table [T]). In 
figure [To] (right panel) we also show the histogram of the different slopes of the fluctuation 
relation shown in the left panel. The fluctuation relation is reasonably well satisfied by 
the experimental data with an average slope of 0.93. However, if we weight the different 
slopes according to the number of pulls for each molecule we obtain 0.96, which is closer 
to the expected value of 1. 



7. Derivation of the value of AGo 

In the preceding sections we showed ways of extracting the values of AGi and AG^^^Z 
by using rupture force kinetics or the fluctuation relation. Now we face the problem of 
extracting the value of the free energy of formation of the hairpin at zero force, AGq, 
using both methods. As we saw in ([T5]) and ( 1C.9I) . and in order to extract AGo, we 
must subtract from the total energy AGi and AGx^'^^ the stretching contribution to 
the free energy, 

px'-N 

GssDna(0 ^x'--f)= Fl-^^^{y)dy , / = fI-^^^{xn) , (40) 

JO 

with f = fc in ([T5l) and / = /max in (1C.9|) . 1^ = {^N + L)d denotes the full contour 
length of the hairpin (section [3TT]) . For the elastic response of the ssDNA, -f^isDNAl^)' 
we use the freely-jointed chain model [3,47], 

fh \ k^T] 




coth 



(41) 



kBT) fh 

with / the contour length of the ssDNA, h the Kuhn length and Y the Young modulus. 
For our temperature and salt conditions we take h = 1.43 nm, Y = 812 pN [3,47]. 
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Figure 10. (Left) Experimental verification of the Crooks fluctuation relation (1351) . 
For each molecule, work data has been shifted to cross the horizontal axis (dashed line) 
at = AGi(/max) (corresponding to the black dots shown in figure [9l right panel). 
Different symbols correspond to the 11 molecules. The continuous line is a weighted 
linear fit to all data that has slope equal to 0.96. (Right) Histograms of the values of 
^Gi(/max) and the slopes corresponding to data shown in the left panel. Statistics 
collected over 11 molecules. The vertical dotted lines show the mean of the histogram 
whereas the dashed line in the histogram of AGi(/niax) indicates the average value of 
AGi that has been obtained from kinetics (section [5] and table [T]) . 



We must stress that in order to extract AGq from either rupture force kinetics data 
or by using the fluctuation relation, it is necessary to take into account the fact that the 
force is not controlled and adopt expressions derived in the appropriate experimental 
mixed ensemble. In what follows we consider both cases. 



Deriving AGq from rupture force kinetics. From the value obtained for AGi from 
rupture force kinetics (table [1]), we now adopt the expression ( |T5l) with AG'i(/c) = AGi 
where AGi has been measured from kinetics as explained in section [3731 However, using 
that expression would lead to incorrect results for AGq- The reason is that in our 
experiments the force is not controlled, as we only control the position of the trap. As 
we have shown in Appendix C a contraction in molecular extension (induced by the 



finite diameter of the hairpin) shifts the free energy of the fully unfolded state, relative 
to any partially unfolded intermediate state, by an amount equal to —fcd^. Therefore, 

AGo = AGi-GssDNA(0->x,v;/c) + /ct^o, /c = i"ssDNA(a;iv) . (42) 

Although the values obtained for AGq show the same dispersion as we saw for AGi in 
section [5] (see figure [8]), the average value of AGq is not far from the Mfold [23] predicted 
value (figure [H] and table [2]) . 



Deriving AGq from the fluctuation relation. We now use (]C.7p . 

AGo = AGi (/n.ax) - GssDNa(0 ^ X'^ ; /^ax) + /maxrfo , (43) 
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Table 2. Mean and standard deviations of free energy parameters obtained from the 
kinetic measurements. Averages are taken over 11 different molecules that have been 
pulled at different pulling speeds: 18.5 nm/s or 1 pN/s (1 molecule, 193 cycles), 36.5 
nm/s or 1.95 pN/s (2 molecules, 160 cycles), 86.2 nm/s or 4.88 pN/s (3 molecules, 570 
cycles), 156 nm/s or 8.1 pN/s (2 molecules, 725 cycles), 274.3 nm/s or 14.9 pN/s (3 
molecules, 1580 cycles). 



fc (pN) 


a^ssDNA (nm) 


do (nm) 


(AGs.dna)'^'" (kBT) 


17.91 


23.3 


2.0 


30.24 


(AGi)"^- (fesT) 


(AGo)"^'" (fceT) 


(AGo)'''" (kcal/mol) 


Mfold (kcal/mol) 


78.7(2.4) 


57.2(2.4) 


33.7(1.5) 


36.81 



Table 3. Mean and standard deviations of free energy parameters obtained from the 
Crooks fluctuation relation. Averages are taken over 11 different molecules that have 
been pulled at different pulling speeds: 18.5 nm/s or 1 pN/s (1 molecule, 193 cycles), 
36.5 nm/s or 1.95 pN/s (2 molecules, 160 cycles), 86.2 nm/s or 4.88 pN/s (3 molecules, 
570 cycles), 156 nm/s or 8.1 pN/s (2 molecules, 725 cycles), 274.3 nm/s or 14.9 pN/s 
(3 molecules, 1580 cycles). The value for A(/^/2fcoff) depends on the maximum and 
minimum force that varies for each molecule. Therefore we just show the average of 
this number without giving the error. 



a^ssDNA (nm) 


do (nm) 


(AG^sdna) (kBT) 


A(/V2fceff) (kBT) 


23.70(03) 


2.0 


32.40(04) 


341.35 


(AGi)™ (knT) 


(AGo)™ (kBT) 


(AGo)'^^ (kcal/mol) 


Mfold (kcal/mol) 


80.9(1.0) 


58.2(1.0) 


34.2(6) 


36.81 



with /max the maximum force along the force cycles. We have determined the value 
of AGo for each molecule using the previously determined values for AG'i(/max) in 
section 16.11 As we saw in figure [HI there is also some variability for the values of AGq 
obtained for different molecules using this method. However, the average of the different 
values is not far from the value expected from Mfold [23] (figure [TT] and table [2]). 

8. Conclusions 

In this work we have investigated the mechanical unfolding of DNA hairpins using 
optical tweezers. We have tested the validity of Kramers-Bell theory for force dependent 
folding/unfolding kinetic rates by comparing theoretical predictions to single molecule 
pulling experiments on DNA hairpins. We introduced the concept of free energy 
landscape for generic nucleic acid (DNA and RNA) molecules and derived simplified 
expressions for the Kramers-Bell kinetic rates. To validate the theoretical predictions 
we have carried out experiments on a specifically designed DNA sequence that displays 
cooperative two-state behaviour. According to theory, this sequence has a free energy 
landscape characterized by two states (folded and unfolded) that are separated by a 
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Figure 11. Histograms of the values of AGo obtained from kinetics (table[2]) and the 
fluctuation relation (table [3|) . The vertical dashed line indicates the best estimate for 
AGo obtained from kinetics (black), the vertical dotted line indicates the best estimate 
obtained from the fluctuation relation (red) and the vertical continuous line the value 
predicted by Mfold [23] (blue) at the experimental conditions (23° G and IM NaCl). 



single barrier and a transition state located in a position along the molecular sequence 
that is independent on the applied force. By doing rupture force measurements we are 
capable of predicting the main parameters that characterize the free energy landscape 
of the hairpin, such as: the distances of the folded and unfolded states to the transition 
state (x^, x^^), the free energy difference between both states (AGi) and the coexistence 
rate (kc)- By measuring the mechanical work and using the Crooks fluctuation relation 
we can also extract the reversible work in our experiments. Both type of measurements 
(rupture force kinetics and the fluctuation relation) yield values for the free energy of 
formation of the hairpin at zero force, AGq, that are compatible with each other and 
with the calorimetric based prediction by Mfold. We remark the following results: 

• Validity of the Kramers— Bell simplified rates. The low value of the released 
molecular extension Xm obtained in kinetic studies suggests that the Kramers-Bell 
model described in section [3] is an over- simplification of the true folding/ unfolding 
kinetics of the hairpin. It provides estimates for the kinetic parameters that fit the 
expected values obtained from theory within 15%, but cannot do better. Another 
possible explanation for such lower values of Xm is molecular fraying at the beginning 
of the hairpin stem that reduces the molecular length and free energy of the hairpin. 
Given the complexity of the molecules we are investigating, a 15% agreement 
between theory and experiment can be considered reasonably good. Yet it would be 
desiderable to explore different sequences and molecular constructions and develop 
models that can improve the agreement between theory and experiments. 
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• Free energy recovery. A noticeable result from our experiments is the strong 
variability observed for the parameters extracted from rupture force kinetic 
studies for different molecules pulled at different speeds (table [T] and figure [H]). 
Variability between parameters is an inherent aspect of single molecule experiments. 
Concerning the free energy of the hairpin AGq, and as shown in figure [TT], the 
fluctuation relation by Crooks is compatible with the estimate obtained from 
rupture force kinetics. Both values are 2-3kcal/mol below the value predicted 
by Mfold leading to a discrepancy between 5-10%. Yet, the fluctuation relation 
provides less spread values and more reliable final estimates for AGq. Many sources 
of error can account for such discrepancy: limitations of the nearest-neighbour base 
pair model used to describe the thermodynamics of unzipping as well as uncertainty 
in the Mfold nearest neighbour free energies; the innacurate knowledge of the elastic 
properties of the released ssDNA; the innacurate determination of the diameter of 
the hairpin; molecular fraying and; force calibration and other experimental errors. 

• The diameter of the hairpin do and the elastic properties of the ssDNA. 

The value of do (— 2 nm) taken from structural studies of the double helix, and the 
polymer model used to model the elastic response of the ssDNA, are particularly 
important for correctly estimating the value of AGq- For example, a 20% error 
in the value of do introduces a 1 kcal/mol error in AGq (— 1.6 k^T at room 
temperature). How reliable is our value for do? How accurate are the parameters 
of the freely jointed chain (Kuhn length. Young modulus and interbase distance) to 
describe the elastic properties of the ssDNA released by the hairpin? It would 
be much interesting to carry out similar detailed investigations in other DNA 
sequences. In this way we could check whether the values we adopted for these 
parameters are generically accurate for arbitrary hairpin sequences (or instead they 
depend on the DNA sequence). 

Two-state models are very useful to address questions related to thermodynamics 
and kinetics of force induced transitions. They guide us in validating and detecting 
limitations of current theories and models describing the folding of nucleic acids 
and proteins. Two-state models are also useful to investigate issues related to the 
irreversibility and dissipation of nonequilibrium small systems as reported in our 
companion paper [48]. Finally, the current investigation is a first step to approach 
the force-induced folding/unfolding kinetics of more complex molecular structures. 
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Figure Al. Schematics of the mixed ensemble. The total distance X is expressed as 

the sum of the different extensions x\,,Xh,Xn- x-t = x^^ + x^^ is the total length of 
the handles, while the extension Xn has been defined in ([2|). The hairpin diameter do 
is taken to be equal to 2 nm and must be included in x„ for all configurations of the 
hairpin with the exception of the fully unfolded one. 



Appendix A. Free energy landscape in the mixed ensemble 

In section |3] we assumed that the experimentally controlled variable is the external force. 
However, this is far from true in single molecule experiments with optical tweezers or 
atomic force microscopy, where the force is a fluctuating variabl^. Is it possible to 
define and compute free energy landscapes beyond the force ensemble worked out in 
section [3]? Here we show how to extend the concept of free energy landscape to the 
mixed ensemble case relevant for our pulling experiments. The presentation here is 
summarized, the interested reader will find details in [33]. 

A schematic representation of the relevant experimental setup corresponding to 
the mixed ensemble is shown in figure lAli Let X, x„, Zb, denote the trap-pipette 
distance, the molecular extension of the hairpin ([2]), the bead position in the trap and 
the handles total extension, respectively. These satisfy X = Xb + Xh + a;„. The force 
is given by / = fcb^^b, where kh is the stiffness of the trap. In the mixed ensemble, the 
molecular extension of the hairpin plus handles = x„ + Xh) and the force (/) are 
fluctuating variables and only X is externally controlled. Similarly to what we did in 
section [3l let G{in,X) denote the free energy necessary to break the first n base pairs 
along the hairpin (starting from the beginning of the fork), thus generating a molecular 
extension (bead-to-bead) when the trap-pipette distance is equal to X. We can write 

G{L, X) = G{L, 0) + Gstrctch(0 ^ in, X) , (A.l) 

II Exceptions arc magnetic tweezers [49] or specifically designed tweezers setups with zero stiffness 
regions [50]. Force feedback systems are not ideal constant force systems as they introduce other sort 
of noise effects due to the limited feedback frequency (typically around 1 kHz). 
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where Gstretch(0 — ^ in] X) gets contributions from the bead, the handles and the ssDNA 
released by the hairpin: 

Gstrctch(0 ^ In, X) = Gb(0 ^ Xb; X) + Gh(0 ^ x^; X) + Gssdna(0 x'"; X) . (A.2) 
The initial condition X = in the term G{in, 0) in ( lA.ip must be understood as that 



position of the trap where all elements are fully relaxed and subject to zero tension. 
The different terms in ( 1A.2I1 are given by 

Gb(0 Xb; X) = r FMdy , (A.3) 

Jo 

Gh(0 ^ Xh; X) = r F^{y)dy , (A.4) 

JO 

GssDna(xo ^ x'"; X) = / F4"DNA(y)dy - F^^j,^^{xn){1 - 5n,7v) , (A.5) 

Jo 

where Fb{y) = k^y is the elastic response of the bead, Fi,{y) stands for the equilibrium 
force-extension curve for the handles and -Fss"dna(2/) is the equilibrium force-extension 
curve for the ssDNA of contour length /„. The last term in ( 1A.5P accounts for the 
shortening of the molecular extension equal to the diameter of the hairpin that occurs 
when the last base pair of the hairpin unzips. The contour length in ( 1A.5P satisfies 
the mechanical equilibrium conditions, 

/ = Fb(xb) = Fh(xh) = F,'^bNA(a^'") , X = Xb + Xh + x'" +do(l -<7v), (A.6) 

defining also the Lagrange multiplier / (corresponding to the average of the instantane- 
ous and fluctuating force acting on each element). Note that for a given pair X) we 
have 3 unknowns (xb, Xh, x„). The three independent equations in (1A.6P fully determine 
the system so we can exactly compute G{£n, X) as a function of in for a given value of 
X. For that we must know the elastic response of the different elements: Fb, F^, -FgsDNA- 

Appendix B. Dependence of AGi(/) across the transition 

Here we show that terms of the type GssDna(0 — > x'^; /) entering in ( ITOi) . ( |TTi) vary with 
/ much less than the corresponding product fx^ does. In other words, 

c^Gssdna(0 — * x'^: f) p ^dx^ , 
gj^ —<^x+f^, (B.l) 

where we assume that x^ generally depends on / (see section ISTT]) . We start from (jl]), 
([T]) and write 



. „-GssDna(0 ^ x'- /) = / ^sm^dy + /-- , (B.2) 
of Jo df df 

where -FsIdna(2/) the integrand depends on / through the dependence of the contour 
length as given in ([T]). The elastic response of biopolymers (e.g. ssDNA) satisfy either 
the worm-like chain model or the freely jointed chain model. In both cases we have that 
the force is sole function of the extension divided by the contour length, 

^ssDNa(Z/) = ^ssDNA(y//) . (B.3) 
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It is also reasonable to assume that the relative change in the contour length Ip of the 
released ssDNA is at most equal to the relative change of the distance x^, 
Idl. 1 dx^ 



From (IB.Sp . (]B.4p we can write 



df dh df - x^ dy df ' ^ ' ' 

Inserting this expression in the integrand in ( IB. 21) and doing an integration by parts we 
obtain 

r^a,,_^^(;.._£Vi..(.)a.). (B.e) 

Plugging this expression into ( 1B.2I) we get 

gGssDNA(O^X^;/) 1 dx^ f^' , dx^ p dx^ 

where we introduced the upper bound -FssDna(i/) ^ / iii the integrand. The last 
inequality is generally valid for short hairpins that are characterized by a rigid ssDNA 
where, according to ( IB. 31) . 

df f 

Therefore we proved (]B.1|) . The same demonstration applies for the term GssDna(0 — >■ 
2^'^; /)) proving that we can approximate (116 ap . ( 11661) by the simplified rates (ITSl) . 

Appendix C. Reversible work in the mixed ensemble 

Here we show which terms contribute to the experimentally measured reversible work 
AG^™!"' appearing in the Crooks fluctuation relation (|35|) . From the definition of 



G{x,X) in Appendix A we can write 

^^xZZ ~ ^i^N^^J -^max) — G{Xo, X^in) , (C-l) 

where x™^^ is the molecular extension of the fully unfolded hairpin at X = Xmax- From 



(IA:T|) . (1X21) . (1X3]) ■ (1X4]) and (IXSl) we obtain 



„max „^max 



-f^^^do + AGo (C.2) 



where AGq was defined in ([8]) and where we have introduced the equilibrium force at 
the end of the stretching cycle at Xmax, /max = -^s'sDnaI^™'^'')- Note that the correction 
term /maxC^o induced by the finite diameter of the hairpin has a different sign in the 
force ensemble ([6]). The reason is that, in the force ensemble, an increase in the total 
distance X of the system (at fixed force /) decreases the total free energy of the system. 
However, in the mixed ensemble, the same increase in the total distance X (followed by 
the corresponding drop in the force /) increases the total free energy of the system. 
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We now introduce the force-dependent rigidity of the handles, 

W) - (^) . (C.3, 

and use flA.6P to change all integration variables in flC.2p to the Lagrange multiplier 
(force) /. We finally get 

^ /-/max fdf /-/max f^f px'^'^^ 

^GtZ = V- + / F77T + / ^ssDNA(y)dy - /maxrfo + AGo , (C.4) 

where we have written /max = -^s'sDnaI^jv^'')- If ^he handles are very rigid compared to 
the rigidity of the trap then ^ and therefore the sum of the two integrands in the 
rhs of ( 1C.4[) can be approximated by 
1 1 1 



kes ' k],{f) ' ^^'^^ 

where fcefr is taken as constant in the interval [/min, /max]- The condition k-^ ^ /cb is 
reasonably well satisfied in our experiments where we use 29-bp handles. Therefore, by 
substituting ( 1C.5I) and defining 

f /max f (\f f /max fdf 



AGi(/max) = AG^- - / 

J f- 



/min ^ /min ^h(/) 

AGJX - ^ ((/max)^ - (/mm)^) , (C.6) 



eff 



we get 

^max 

AGi(/max) = / " i^sDNA(y)dl/ - /maxrfo + AGq , /max = i^s'sDNA (a^^^) • (C.7) 

Note that the expression (]C.6P for AGi(/jnax) is the equivalent of (fTOj) in the force 



ensemble. According to ( 1C.7I) it only depends on the maximum force at which the 
molecule is unfolded, therefore we explicitely indicate such dependence of AGi in its 
argument. Using flC.6l) . flC.7p it is possible to extract AGq from the knowledge of 



AGx^Zi /min, /max and the clastic properties of the ssDNA: 

AG^r = ^ ((/max)' - (/min)') + AGi(/max) (C.8) 
AGq = AG^™'''' — — \- — ^ GssDNa(0 — > Xn; /max) + /maxC^O • (C.9) 

References 

[1] Ritort F 2006 J. Phys.: Condens. Matter 18 R531 | cond-mat/0609378] 
[2] Hormefio S and Arias- Gonzalez J R 2006 Biol. Cell 98 679 
[3] Smith S B, Cui Y and Bustamante C 1996 Science 271 795 

[4] Wang M D, Yin H, Landick R, Gelles J and Block S M 1997 Biophys. J. 72 1335 
[5] Dessinges M-N, Maier B, Zhang Y, Peliti M, Bcnsimon D and Croquette V 2002 Phys. Rev. Lett. 
89 248102 

[6] Rief M, Gautel M, Oesterhelt F, Fernandez J M and Gaub H E 1997 Science 276 1109 
[7] Liphardt J, Onoa B, Smith S B, Tinoco I Jr and Bustamante C 2001 Science 292 733 



Dynamic force spectroscopy of DNA hairpins. I. 



28 



Fernandez J M, Chu S and Oberhauser A F 2001 Science 292 653 

Leger J F, Robert J, Bourdieu L, Chatenay D and Marko J F 1998 Proc. Nat. Acad. Sci. 95 12295 
Yin H, Wang M D, Svoboda K, Landick R, Block S M and Gcllcs J 1995 Science 270 1653 
Finer J T, Simmons R M and Spudich J A 1994 Nature 368 113 
Noji H, Yasuda R, Yoshida M and Kinosita K Jr 1997 Nature 386 299 
Strick T R, Croquette V and Bensimon D 2000 Nature 404 901 

Abbondanzieri E A, Greenleaf W J, Shaevitz J W, Landick R and Block S M 2005 Nature 438 
460 

Wen J-D. Lancaster L, Hodges C, Zeri A-C, Yoshimura S H, NoUer H F, Bustamante C and Tinoco 

I Jr 2008 Nature 452 598 
Woodsidc M T, Anthony P C, Behnke-Parks W M, Larizadeh K, Herschlag D and Block S M 2006 

Science 314 1001 

Woodsidc M T, Belmke-Parks W M, Larizadeh K, Travers K, Herschlag D and Block S M 2006 

Proc. Nat. Acad. Sci. 103 6190 
Bustamante C and Smith S B 2006 US Patent 7, 133,132, B2 
Smith S B, Cui Y and Bustamante C 2003 Methods in Enzymology 361 134 
Ritort F 2008 Adv. Chem. Phys. 137 31 
SantaLucia J Jr 1998 Proc. Nat. Acad. Sci. 95 1460 

SantaLucia J Jr and Hicks D 2004 Annu. Rev. Biophys. Biomol. Struct. 33 415 
Zuker M 2003 Nucleic Acids Res. 31 3406 

Cocco S, Monasson R and and Marko J 2001 Proc. Nat. Acad. Sci. 98 8608 
Cocco S, Monasson R and and Marko J 2002 Phys. Rev. E 65 041907 
BeU G I 1978 Science 200 618 

Tinoco I Jr 2004 Annu. Rev. Biophys. Biomol. Struct. 33 363 
Manosas M, Junier I and Ritort F 2009 Phys. Rev. E to be published 
Leffer J E 1953 Science 117 340 

Hyeon C and Thirumalai D 2005 Proc. Nat. Acad. Sci. 102 6789 
Hyeon C and Thirumalai D 2006 Biophys. J. 90 3410 

Manosas M, Collin D and Ritort F 2006 Phys. Rev. Lett. 96 218301 |cond-mat/060625^ 



Manosas M and Ritort F 2005 Biophys. J. 88 3224 [cond-mat/0405035 
Evans E and Ritchie K 1997 Biophys. J. 72 1541 
Schlierf M and Rief M 2006 Biophys. J. 90 L33 
ShiUcock J C and Seifert U 1996 Phys. Rev. E 57 7301 
Hummer G and Szabo A 2003 Biophys. J. 85 5 



Gerland U, Bundschuh R and Hwa T 2001 Biophys. J. 81 1324 [cond-mat/0101250 
U. Gerland U, Bundschuh R and Hwa T 2003 Biophys. J. 84 2831 |co nd-mat/020820g 
Wen J D, Manosas M, Li P T X, Smith S B, Bustamante C, Ritort F and Tinoco I Jr 2007 Biophys. 
J. 92 2996 

Manosas M, Wen J D, Li P T X, Smith S B, Bustamante C, Tinoco I Jr and Ritort F 2007 Biophys. 
J. 92 3010 

Hyeon C, Morrison G and Thirumalai D 2008 Proc. Nat. Acad. Sci. 105 9604 
Crooks G E 1999 Phys. Rev. E 60 2721 cond-ma t/9901352, 

Collin D, Ritort F, Jarzynski C, Smith S B, Tinoco I Jr and Bustamante C 2005 Nature 437 231 
Bennett C H 1976 J. Comp. Phys. 22 245 

Shirts M R, Bair E, Hooker G and Pande V S 2003 Phys. Rev. Lett. 91 140601 

Huguet J M, Forns N, Bizarro C V, Smith S B, Bustamante C and Ritort F (unpublished). 

Our companion paper, Manosas M, Mossa A, Forns N, Huguet J M and Ritort F 2008 Dynamic 

force spectroscopy of DNA hairpins. II. Irreversibility and dissipation. 
Gosse C and Croquette V 2002 Biophys. J. 82 3314 

Greenleaf W J, Woodsidc M T, Abbondanzieri E A and Block S M 2005 Phys. Rev. Lett. 95 208102 



